Sequence Analysis on a 216-Processor Beowulf Cluster

نویسندگان

  • Katerina Michalickova
  • Moyez Dharsee
  • Christopher W. V. Hogue
چکیده

In this work we describe the implementation of a 216processor Beowulf cluster with switched gigabit Ethernet networking. This design includes the use of a 8-CPU high performance midrange computer with 8 gigabit ports as a cluster head, a design that limits I/O contention. We have been developing applications software for bioinformatics research in protein folding, as well as the MoBiDiCK system for managing cluster applications that is extensible to general purpose distributed computing. In addition to the cluster architecture, we present a new cluster application for bioinformatics, a variant of the BLAST family of sequence comparison programs. MOBLAST performs the BLAST algorithm in an exhaustive manner, avoiding its initial heuristic approach to finding hits. This effectively slows BLAST down to approach the speed of other comprehensive search methods such as a Smith-Waterman alignment. MOBLAST requires a sizeable cluster to run. We describe the development of MOBLAST and its use in making an exhaustive M×N database of alignments where M is the set of protein sequences with known 3-D structures, and N is the set of all protein sequences. This M×N database of protein alignments will facilitate further research in protein folding, the ultimate aim of our work with Beowulf cluster technology. Furthermore, we describe a general algorithm for partitioning M×N problems and implement this in the MoBiDiCK computing model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Making A Beowulf Cluster

The Scientific Computation Research Center (SCOREC) was established jointly by the Schools of Engineering and Science at Rensselaer Polytechnic Institute. SCOREC models real world physical reactions by simulating them on computers. The numerical solution of these models often requires discretizations with a number of unknowns approaching one billion. Problems of this scale can only be solved on...

متن کامل

Analysis of a prototype intelligent network interface

With a focus on commodity PC systems, Beowulf clusters traditionally lack the cutting edge network architectures, memory subsystems, and processor technologies found in their more expensive supercomputer counterparts. Many users find that what Beowulf clusters lack in technology, they more than make up for with their significant cost advantage. In this paper, an architectural extension that add...

متن کامل

Massively Parallel Solutions for Molecular Sequence Analysis

In this paper we present new approaches to high performance protein database scanning on two novel massively parallel architectures to gain supercomputer power at low cost. The first architecture is built around a Beowulf PCcluster linked by a high-speed network and fine-grained parallel Systola 1024 processor boards connected to each node. The second architecture is the Fuzion 150, a new paral...

متن کامل

Simulations of Three-Dimensional Detonations by the CE/SE Method Using a Very Low-Cost Beowulf Cluster

1 Ph.D. Student, AIAA Student Member, Email: [email protected] 2 Senior Research Associate, AIAA Member, [email protected] Associate Professor, AIAA Member, Email: [email protected] ; http://141.217.13.61/ 4 Senior Scientist, AIAA Member, Email: [email protected] ABSTRACT In this paper, we report the experience of calculating three-dimensional detonations by the Space-...

متن کامل

2 00 8 Beowulf Analysis Symbolic INterface BASIN : Interactive Parallel Data Analysis for Everyone ∗

The advent of affordable parallel computers such as Beowulf PC clusters and, more recently, of multi-core PCs has been highly beneficial for a large number of scientists and smaller institutions that might not otherwise have access to substantial computing facilities. However, there has not been an analogous progress in the development and dissemination of parallel software: scientists need the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000